CODEBOOK

-------------------------------------------

"All the News that’s Fit to Fabricate: AI-Generated Text as a Tool of Media Misinformation"
Sarah Kreps, Miles McCain, and Miles Brundage 
August 2020

-------------------------------------------

Contents 
* Experiment One 
* Experiment Two 
* Experiment Three 

Full details of the survey treatments and responses are available in the Appendix.

Unless otherwise defined, variable labels "fall back" to previous experiments -- in other words,
if a variable name is present in the data for experiment two and three but is not explicitly defined in the
codebook, then its label is the same as it was in experiment one. (`education`'s label, for example, is only
defined in experiment one, but is present in both experiments two and three, despite not having an explicit
label.)

-------------------------------------------

EXPERIMENT ONE

Dependent Variables 

* credible: belief that content is credible (1=very credible; 2=somewhat credible; 3=not very credible; 4=not at all credible) 
* credbi: Binary measure of belief that content is credible (0=not credible; 1=credible) 

Treatment Variables (355M version) 

* treats: the treatment shown (1=New York Times baseline; 2=Two paragraph input to GPT-2 (1); 3=Two paragraph input to GPT-2 (2); 4=Two paragraph input to GPT-2 (3)

Treatment Variables (724M and 1.5B versions)

* treats: the treatment shown (1=New York Times baseline; 2=One paragraph input (1); 3=One paragraph input (2); 4=Two paragraph input (1); 5=Two paragraph input (2))

Demographic Variables 

* education: respondent's highest level of education (1=Less than high school; 2=High school-GED; 3=Some college; 4=2-year college degree; 5=4-year college degree; 6=Post-graduate degree (MA, MBA, MD, JD, PhD)) 
* college: college degree or higher (1=yes, 0=no)
* income: respondent's income (1=Less than 5,000; 2=5,000-7,499; 3=7,500-9,999; 4=10,0000-12,499; 5=12,500-14,999; 6=15,000-19,999; 7=20,000-24,999; 8=25,000-29,999; 9=30,000-34,999; 10=35,000-39,999; 11=40,000-49,999; 12=50,000=59,999; 13=60,000-74,999; 14=75,000-84,999; 15=85,000-99,999; 16=100,000-124,999; 17=125,000-149,999; 18=150,000-174,999; 19=175,000+; 20=prefer not to say)
* race: respondent's race (1=White; 2=Black or African-American; 3=Hispanic or Latino; 4=Asian or Asian American; 5=Native American; 6=Mixed race; 7=Middle Eastern; 8=Other)
* party: respondent's political party (1=Strong Democrat; 2=Not very strong Democrat; 3=Independent (close to Democrat); 4=Independent (close to neither); 5=Independent (close to Republican); 6=Not very strong Republican; 7=Strong Republican)
* gender: respondent's gender (1=Male; 2=Female) 
* hawk1: respondent's response to "some people think that US military force should never be used under any circumstances. They are at “1” no the scale below. Other people think there are many situations in which US military force should be used to deal with problems. They are at “7” on the scale below." (1-7 scale) 
* hawk2: respondent's response to "The US needs to play an active role in solving conflicts around the world." (1=Strongly agree; 2=Somewhat agree; 3=Somewhat disagree; 4=Strongly disagree) 
* nkthreat1: respondent's response to "Do you see North Korea’s development of nuclear weapons as a critical threat to vital US interests, an important but not critical threat to vital US interests, or not at all an important threat to vital US interests?" (1=Critical; 2=Important but not critical; 3=Not important at all)
* nkthreat2: respondent's response to "What is your overall opinion of North Korea? Is it very favorable, mostly favorable, mostly unfavorable, or very unfavorable." (1=Very favorable; 2=Mostly favorable; 3=Mostly unfavorable; 4=Very unfavorable) 
* news: frequency with which respondents get their news about politics from TV, radio, newspaper, or online outlets. (1=a couple of times a month or less; 2=once a week; 3=2-3 times a week; 4=daily; 5=several times a day)
* partyid: a variable derived from `party` (1=democrat, all levels; 2=neither democrat nor republican (independent); 3=republican, all levels)

-------------------------------------------

EXPERIMENT TWO

* govaware: respondent's response to "Some people don’t pay much attention to news about politics and government. How about you. Would you say that you are very much interested, somewhat interested, not much interested, not at all interested?" (1=very much interested; 2=somewhat interested; 3=not much interested; 4=not at all interested)
* ideology: respondent's response to "When it comes to your own politics, would you say that youo are very conservative, somewhat conservative, moderate, somewhat liberal, or very liberal?" (1=very conservative; 2=somewhat conservative; 3=moderate; 4=somewhat liberal; 5=very liberal)
* party: respondent's response to "Generally speaking, with which of these political parties do you most closely identify?" (1=Republican; 2=Democrat; 3=Green; 4=Libertarian; 5=No political party) 
* education: respondent's highest level of education (1=Less than high school; 2=High school-GED; 3=Some college; 4=2-year college degree; 5=4-year college degree; 6=Post-graduate degree (MA, MBA, MD, JD, PhD)) 
* college: college degree or higher (1=yes, 0=no)
* income: respondent's income (1=Less than 5,000; 2=5,000-7,499; 3=7,500-9,999; 4=10,0000-12,499; 5=12,500-14,999; 6=15,000-19,999; 7=20,000-24,999; 8=25,000-29,999; 9=30,000-34,999; 10=35,000-39,999; 11=40,000-49,999; 12=50,000=59,999; 13=60,000-74,999; 14=75,000-84,999; 15=85,000-99,999; 16=100,000-124,999; 17=125,000-149,999; 18=150,000-174,999; 19=175,000+; 20=prefer not to say)
* gender: respondent's gender (1=Male; 2=Female) 
* imminc: respondent's response to "Thinking now about immigrants—that is, people who come from other countries to live here in the United States—in your view, should immigration be kept at its present level, increased, or decreased?" (1=increased considerably; 2=increased somewhat; 3=present level; 4=decreased somewhat; 5=decreased considerably) 
* credible: respondent's belief that the treatment article was credible (1=very credible; 2=somewhat credible; 3=not very credible; 4=not at all credible)
* share: whether the respondent would share this story on social media (1=yes; 2=no)
* wall: whether the respondent favors or opposes the construction of walls along the US-Mexico border (1=strongly favor; 2=favor; 3=oppose; 4=strongly oppose)
* trump: whether the respondent approves or disapproves of Trump handling of his job (1=strongly approve; 2=somewhat approve; 3=somewhat disapprove; 4=strongly disapprove)

-------------------------------------------

EXPERIMENT THREE

* hawk1: respondent's response to "some people think that US military force should never be used under any circumstances. They are at “1” no the scale below. Other people think there are many situations in which US military force should be used to deal with problems. They are at “7” on the scale below." (1-7 scale) 
* hawk2: respondent's response to "The US needs to play an active role in solving conflicts around the world." (1=Strongly agree; 2=Somewhat agree; 3=Somewhat disagree; 4=Strongly disagree) 
* ideology: respondent's response to "When it comes to your own politics, would you say that youo are very conservative, somewhat conservative, moderate, somewhat liberal, or very liberal?" (1=very conservative; 2=somewhat conservative; 3=moderate; 4=somewhat liberal; 5=very liberal)
* credible: respondent's belief that the treatment article was credible (1=very credible; 2=somewhat credible; 3=not very credible; 4=not at all credible)
* believable: respondent's belief that the treatment article was believable (1=very believable; 2=somewhat believable; 3=not very believable; 4=not at all believable)
* accurate: respondent's belief that the treatment article was accurate (1=very accurate; 2=somewhat accurate; 3=not very accurate; 4=not at all accurate)
* clear: respondent's belief that the treatment article clearly explains the event in question (1=very clearly; 2=somewhat clearly; 3=not very clearly; 4=not at all clearly)
* share: whether the respondent would share this story on social media (1=yes; 2=no)
* fake: whether the respondent says they have ever seen fake news (1=definitely no; 2=possibly no; 3=possibly yes; 4=definitely yes)
 